Subsampling versus bootstrapping in resampling-based model selection for multivariable regression.

نویسندگان

  • Riccardo De Bin
  • Silke Janitza
  • Willi Sauerbrei
  • Anne-Laure Boulesteix
چکیده

In recent years, increasing attention has been devoted to the problem of the stability of multivariable regression models, understood as the resistance of the model to small changes in the data on which it has been fitted. Resampling techniques, mainly based on the bootstrap, have been developed to address this issue. In particular, the approaches based on the idea of "inclusion frequency" consider the repeated implementation of a variable selection procedure, for example backward elimination, on several bootstrap samples. The analysis of the variables selected in each iteration provides useful information on the model stability and on the variables' importance. Recent findings, nevertheless, show possible pitfalls in the use of the bootstrap, and alternatives such as subsampling have begun to be taken into consideration in the literature. Using model selection frequencies and variable inclusion frequencies, we empirically compare these two different resampling techniques, investigating the effect of their use in selected classical model selection procedures for multivariable regression. We conduct our investigations by analyzing two real data examples and by performing a simulation study. Our results reveal some advantages in using a subsampling technique rather than the bootstrap in this context.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Alternative Bootstrap to Moving Blocks for Time Series Regression Models

The purpose of this paper is to introduce and examine two alternative, although similar, approaches to the Moving Blocks and subsampling Bootstraps to bootstrapping the estimator of the parameters for time series regression models. More specifically, the first bootstrap is based on resampling from the normalised discrete Fourier transform of the residuals of the model, whereas the second is fro...

متن کامل

Resampling Methods for Meta-Model Validation with Recommendations for Evolutionary Computation

Meta-modeling has become a crucial tool in solving expensive optimization problems. Much of the work in the past has focused on finding a good regression method to model the fitness function. Examples include classical linear regression, splines, neural networks, Kriging and support vector regression. This paper specifically draws attention to the fact that assessing model accuracy is a crucial...

متن کامل

Evaluating Variance of the Model Credibility Index

Model credibility index is defined to be a sample size under which the power of rejection equals 0.5. It applies goodness-of-fit testing thinking and uses a one-number summary statistic as an assessment tool in a false model world. The estimation of the model credibility index involves a bootstrap resampling technique. To assess the consistency of the estimator of model credibility index, we in...

متن کامل

The Impact of Bootstrap Methods on Time Series Analysis

Sparked by Efron’s seminal paper, the decade of the 1980s was a period of active research on bootstrap methods for independent data— mainly i.i.d. or regression set-ups. By contrast, in the 1990s much research was directed towards resampling dependent data, for example, time series and random fields. Consequently, the availability of valid nonparametric inference procedures based on resampling ...

متن کامل

Comments on: Control of the false discovery rate under dependence using the bootstrap and subsampling

In this enlightening and stimulating paper, Professors Romano, Shaikh, and Wolf construct two novel resampling-based multiple testing methods using the bootstrap and subsampling techniques and theoretically prove that these methods approximately control the FDR under weak regularity conditions. The theoretical results provide a satisfactory solution to an important and challenging problem in mu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biometrics

دوره 72 1  شماره 

صفحات  -

تاریخ انتشار 2016